A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder

نویسندگان

  • Jing Wang
  • Jingming Kuang
  • Shenghui Zhao
چکیده

A variable bit rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.86kbps average bit rate which combines closed-loop multimode techniques is presented in this paper. Each kind of characteristic waveform (CW) surface is regarded as only rapidly evolving waveforms (REWs), only slowly evolving waveforms (SEWs) or mixed REWs plus SEWs in different cases of CWs evolving performance. A cost criterion based on weighted signal-to-noise (WSNR) value in the spectral domain is used to make the mode selection. Experiments show that the proposed closed-loop multimode VBR-CWI coder has reduced the average bit rate markedly and improved the synthesis speech quality to some extent compared to the original fixed bit rate coder. Further research can be done in order to have a more accurate perceptual objective quality measurement instead of WSNR and there is also need to pay attention to computational complexity of closed-loop method in real-time applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new low bit rate speech coder based on intraframe waveform interpolation

A new characteristic waveform (CW) interpolation coder is proposed in this paper. In the proposed coder, two characteristic waveforms are extracted from LPC residual signal at each frame. The Waveform Interpolation (WI) is operated within the frame. In the novel WI, variable dimension vector quantization (VDVQ) and power vector quantization are proposed and the low frequency band (LFB) and high...

متن کامل

Source controlled variable bit-rate speech coder based on waveform interpolation

This paper describes a source controlled variable bit-rate (SCVBR) speech coder based on the concept of prototype waveform interpolation. The coder uses a four mode classification : silence, voiced, unvoiced and transition. These modes are detected after the speech has been decomposed into slowly evolving (SEW) and rapidly evolving (REW) waveforms. A voicing activity detection (VAD), the relati...

متن کامل

A Low-complexity Improved WI Speech Coding at 2kbps

The waveform interpolation (WI) speech coding presents a good performance at low bit rate. However, the algorithm has a very high complexity in computation. In this paper, a low-complexity improved waveform interpolation speech coder at 2kbps is proposed. The improved coding scheme has greatly reduced the computational complexity and improved the reconstructed speech quality by using various te...

متن کامل

Multimode Tree Coding of Speech with Perceptual Pre-weighting and Post-weighting

A low delay and low complexity speech coder based on Multimode Tree Coding is proposed. In our Multimode Tree Coder, a simple mode classification method along with frame energy are used to classify the input speech frames into five different modes. Each mode is coded at a suitable bit-rate using a Tree coder with computationally efficient perceptual error pre-weighting and post-weighting filter...

متن کامل

Very low rate speech coding using temporal decomposition and waveform interpolation

In very low rate coding the aim is to accurately represent speech characteristics as efficiently as possible. High coding gains for the spectral features can be achieved through the use of temporal decomposition. Waveform interpolation coders accurately represent the excitation using characteristic waveforms (CWs) extracted at a constant rate. In this paper, the two approaches are combined into...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006